Maintaining Anti-Monotone Property for Generator with Weight and Its Mining Method
نویسندگان
چکیده
Generator is a concise representation for frequent itemset. And it has the anti-monotone property as the frequent itemset does, which is an important property in real applications. But when itemsets are attached with weights to balance importances between themselves, the anti-monotone property of generator may not hold. Additionally generator with weight may become tough to be dealt with in many circumstances. In this paper, we adapt support weight calculation to generator definition under weight support framework through specific techniques. The anti-monotone property of generator with weight can be kept to facilitate mining works. A new method for mining generators with weights is proposed. It exploits depth-first mining strategy and prunes search space with little cost. Experimental results show that the proposed method runs properly and achieves good performance.
منابع مشابه
Generator of Hypotheses – an Approach of Data Mining Based on Monotone Systems Theory
Generator of hypotheses is a new method for data mining. It makes possible to classify the source data automatically and produces a particular enumeration of patterns. Pattern is an expression (in a certain language) describing facts in a subset of facts. The goal is to describe the source data via patterns and/or IF...THEN rules. Used evaluation criteria are deterministic (not probabilistic). ...
متن کاملEfficient Mining of High Utility Itemsets from Large Datasets
High utility itemsets mining extends frequent pattern mining to discover itemsets in a transaction database with utility values above a given threshold. However, mining high utility itemsets presents a greater challenge than frequent itemset mining, since high utility itemsets lack the anti-monotone property of frequent itemsets. Transaction Weighted Utility (TWU) proposed recently by researche...
متن کاملMining utility-oriented association rules: An efficient approach based on profit and quantity
Association rule mining has been an area of active research in the field of knowledge discovery and numerous algorithms have been developed to this end. Of late, data mining researchers have improved upon the quality of association rule mining for business development by incorporating the influential factors like value (utility), quantity of items sold (weight) and more, for the mining of assoc...
متن کاملScaling up cosine interesting pattern discovery: A depth-first method
This paper presents an efficient algorithm called CosMinert for interesting pattern discovery. The widely used cosine similarity, found to possess the null-invariance property and the anti-cross-support-pattern property, is adopted as the interestingness measure in CosMinert . CosMinert is generally an FP-growth-like depth-first traversal algorithm that rests on an important property of the cos...
متن کاملMethod for Extracting Valuable Common Structures from Heterogeneous Rooted and Labeled Tree Data
The most commonly adopted approach to find valuable information from tree data is to extract frequently occurring subtree patterns. Because mining frequent tree patterns has a wide range of applications such as XML mining, web usage mining, bioinformatics, and network multicast routing, many algorithms have been recently proposed to find the patterns. However, existing tree mining algorithms su...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- JCP
دوره 8 شماره
صفحات -
تاریخ انتشار 2013